Cheshire II at INEX: Using a Hybrid Logistic Regression and Boolean Model for XML Retrieval

نویسنده

Ray R. Larson

چکیده

This paper describes the retrieval approach that Berkeley used in the INEX evaluation. The primary approach is the combination of a probabilistic methods using a Logistic regression algorithm for estimation of collection relevance and element relevance, along with Boolean constraints. The paper also discusses our approach to XML component retrieval and how component and document retrieval are combined in the Cheshire II system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cheshire II at INEX ’03: Component and Algorithm Fusion for XML Retrieval

This paper describes the retrieval approach that UC Berkeley used in the 2003 INEX evaluation. As in last year’s INEX, our primary approach is the combination of a probabilistic methods using a Logistic regression algorithm for estimation of document (article) relevance and/or element relevance, along with Boolean constraints. This year we also used data fusion techniques to combine results fro...

متن کامل

Xxl @ Inex 2003

Information retrieval on XML combines retrieval on content data (element and attribute values) with retrieval on structural data (element and attribute names). Standard query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. Such search condi...

متن کامل

RMIT University at INEX 2005: Ad Hoc Track

Different scenarios of XML retrieval are analysed in the INEX 2005 ad hoc track, which reflect different query interpretations and user behaviours that may be observed during XML retrieval. The RMIT University group’s participation in the INEX 2005 ad hoc track investigates these XML retrieval scenarios. Our runs follow a hybrid XML retrieval approach that combines three information retrieval m...

متن کامل

Cheshire II at GeoCLEF: Fusion and Query Expansion for GIR

In this paper I will describe the Berkeley (group 1) approach to the GeoCLEF task for CLEF 2005. The main technique we are testing is the fusion of multiple probabilistic searches against different XML components using both Logistic Regression (LR) algorithms and a version of the Okapi BM-25 algorithm. We also combine multiple translations of queries in cross-language searching. Since this is t...

متن کامل

Enhancing Content-And-Structure Information Retrieval using a Native XML Database

Three approaches to content-and-structure XML retrieval are analysed in this paper: first by using Zettair, a fulltext information retrieval system; second by using eXist, a native XML database, and third by using a hybrid XML retrieval system that uses eXist to produce the final answers from likely relevant articles retrieved by Zettair. INEX 2003 content-and-structure topics can be classified...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2002

Cheshire II at INEX: Using a Hybrid Logistic Regression and Boolean Model for XML Retrieval

نویسنده

چکیده

منابع مشابه

Cheshire II at INEX ’03: Component and Algorithm Fusion for XML Retrieval

Xxl @ Inex 2003

RMIT University at INEX 2005: Ad Hoc Track

Cheshire II at GeoCLEF: Fusion and Query Expansion for GIR

Enhancing Content-And-Structure Information Retrieval using a Native XML Database

عنوان ژورنال:

اشتراک گذاری